1 GGCACGAGGT CCCCGACGCG CCCCGCCCAA CCCCTACGAT GAAGAGGGCG 50 
1 M K R A 4 



51 TCCGCTGGAG GGAGCCGGCT GCTGGCATGG GTGCTGTGGC TGCAGGCCTG 100 
5SAGG SRL LAW VLWL QAW21 

101 GCAGGTGGCA GCCCCATGCC CAGGTGCCTG CGTATGCTAC AATGAGCCCA 150 
22QVA APCP GAC VCY NEPK38 

151 AGGTGACGAC AAGCTGCCCC CA6CAGGGCC TGCAGGCTGT GCCCGTGGGC 200 
39 VTT SCP QQGL QAV PVG 54 

201 ATCCCTGCTG CCAGCCAGCG CATCTTCCTG CACGGCAACC GCATCTCGCA 250 
55IPAA SQR IFL HGNR ISH71 

251 TGTGCCAGCT GCCAGCTTCC GTGCCTGCCG CAACCTCACC ATCCTGTGGC 300 
72VPA ASFR ACR NLT ILWL88 

301 T6CACTCGAA TGTGCTGGCC CGAATTGATG CGGCTGCCTT CACTGGCCTG 350 
89 HSN VLA RIDA AAF TGL 104 

351 GCCCTCCTGG A6CAGCTGGA CCTCAGCGAT AATGCACAGC TCCGGTCTGT 400 
105ALLE QLD LSD NAQL RSV121 

401 GGACCCTGCC ACATTCCACG GCCTGGGCCG CCTACACACG CTGCACCTGG 450 
122 DPA TFHG LGR LHT LHLD138 

451 ACCGCTGCGG CCTGCAGGAG CTGGGCCCGG GGCTGTTCCG CGGCCTGGCT 500 
139 RCG LQE LGPG LFR GLA 154 

501 GCCCTGCAGT ACCTCTACCT 6CAGGACAAC GCGCTGCAGG CACTGCCTGA 550 
155ALQY LYL QDN ALQA LPD171 

551 TGACACCTTC CGCGACCTGG GCAACCTCAC ACACCTCTTC CTGCACGGCA 600 
172 DTF RDLG NLT HLF LHGN188 

FIG. lA 



601 ACCGCATCTC CAGCGTGCCC GAGCGCGCCT TCCGTGGGCT GCACAGCCTC 650 
189 RIS SVP ERAF RGL HSL 204 

651 GACCGTCTCC TACTGCACCA GAACCGCGTG GCCCATGTGC ACCCGCATGC 700 
205 DRLL LHQ NRV AHVH PHA221 

701 CTTCCGTGAC CTTGGCCGCC TCATGACACT CTATCTGTTT GCCAACAATC 750 
222 FRD LGRL MTL YLF ANNL238 

751 TATCAGCGCT GCCCACTGAG GCCCTGGCCC CCCTGCGTGC CCTGCAGTAC 800 
239 SAL PTE ALAP LRA LQY 254 

801 CTGAGGCTCA ACGACAACCC CTGGGTGTGT GACTGCCGGG CACGCCCACT 850 
255 LRLN DNP WVC DCRA RPL271 

851 CTGGGCCTGG CTGCAGAAGT TCCGCGGCTC CTCCTCCGAG GTGCCCTGCA 900 
272 WAW LQKF RGS SSE VPCS288 

901 GCCTCCCGCA ACGCCTGGCT G6CCGTCACC TCAAACGCCT AGCT6CCAAT 950 
289 LPQ RLA GRDL KRL AAN 304 

951 GACCTGCAGG GCTGCGCTGT GGCCACCGGC CCTTACCATC CCATCTGGAC 1000 
305 DLQG CAV ATG PYHP IWT321 

1001 CGGCAGGGCC ACCGATGAGG AGCCGCTGGG GCTTCCCAAG TGCTGCCAGC 1050 
322 GRA TDEE PLG LPK CCQP338 

1051 CAGATGCCGC TGACAAGGCC TCAGTACTGG AGCCTGGAAG ACCAGCTTCG 1100 
339 DAA DKA SVLE PGR PAS 354 

1101 GCAGGCAATG CGCTGAAGGG ACGCGTGCCG CCCGGTGACA GCCCGCCGGG 1150 
355 AGNA LKG RVP PGDS PPG 371 

1151 CAACGGCTCT GGCCCACGGC ACATCAATGA CTCACCCTTT GGGACTCTGC 1200 
372 MGS GPRH IND SPF GTLP388 

FIG. IB 



1201 CTGGCTCTGC TGAGCCCCCG GCTCACTGCA GTGCGGCCCG AGGGCTCCGA 1250 
389 GSA EPP AHCS AAR GLR 404 

1251 GCCACCAGGT TCCCCACTTC GGGCCCTCGC CGGAGGCCAG GCTGTTCACG 1300 
405 ATRF PTS GPR RRPG CSR421 

1301 CAAGAACCGC ACCCGCAGCC ACTGCCGTCT GGGCCAGGCA GGCAGCGGGG 1350 
422 KNR TRSH CRL GQA GSGG438 

1351 GTGGCGGGAC TGGTGACTCA GAAGGCTCAG GTGCCCTACC CAGCCTCACC 1400 
439 GGT GDS EGSG ALP SLT 454 

1401 TGCAGCCTCA CCCCCCTGGG CCTGGCGCTG GTGCTGTGGA CAGTGCTTGG 1450 
455 CSLT PLG LAL VLWT VLG471 

1451 GCCCTGCTGA CCCCCAGCGG ACACAAGA6C GTGCTCAGCA GCCAGGTGTG 1500 
472 P C * 473 

1501 TGTACATACG GGGTCTCTCT CCACGCCGCC AAGCCAGCCG GGCGGCCGAC 1550 

1551 CCGTGGGGCA GGCCAGGCCA GGTCCTCCCT GATGGACGCC TGCC6CCCGC 1600 

1601 CACCCCCATC TCCACCCCAT CATGTTTACA GGGTTCGGCG GCA6CGTTT6 1650 

1651 TTCCAGAACG CCGCCTCCCA CCCAGATCGC GGTATATAGA GATATGCATT 1700 

1701 TTATTTTACT TGTGTAAAAA TATCGGACGA CGTGGAATAA AGAGCTCTTT 1750 



1751 TCTTAAAAAA AAAAAAAAAA AACTCGA 1777 

FIG. IC 
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